Conference Proceedings

Taking Risks with Confidence

Rodger Benham, Ben Carterette, Alistair Moffat, J Shane Culpepper

Proceedings of the 24th Australasian Document Computing Symposium on - ADCS '19 | ACM Press | Published : 2019

Abstract

Risk-based evaluation is a failure analysis tool that can be combined with traditional effectiveness metrics to ensure that the improvements observed are consistent across topics when comparing systems. Here we explore the stability of confidence intervals in inference-based risk measurement, extending previous work to five different commonly used inference testing techniques. Using the Robust04 and TREC Core 2017 NYT corpora, we show that risk inferences using parametric methods appear to disagree with their non-parametric counterparts, warranting further investigation. Additionally, we explore how the number of topics being evaluated affects confidence interval stability, and find that mor..

View full abstract

University of Melbourne Researchers

Grants

Awarded by Australian Research Council


Funding Acknowledgements

The first author was supported by an RMIT VCPS Scholarship. The third and fourth authors were supported by the Australian Research Council (project DP190101113). Christina Knudson assisted by locating some relevant material.